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Abstract  -  Acoustic  Echo  Cancellation  (AEC)  often  involves 
adaptive  filters  with  large  numbers  of  taps,  which  results  in 
poor  performance  in  real-time  applications.  The  utilization  of 
Delayless  Suhhand  Adaptive  Eilter  (DSAE)  helps  reduce  com¬ 
putations  and  improve  the  overall  performance.  However, 
conventional  oversampled  suhhand  adaptive  filters  mainly  use 
DET  or  GDET  based  analysis/synthesis  filter  banks  and  gener¬ 
ate  “complex-valued”  subband  signals.  This  is  particularly 
inefficient  when  applying  the  Affine  Projection  Algorithm 
(APA),  a  popular  adaptive  algorithm  for  AEC  problem,  to  each 
subband.  Eor  APA  implementation,  real- valued  signals  show 
higher  efficiency  than  complex  signals.  In  this  paper,  we  pre¬ 
sent  a  real-valued  delayless  subband  APA  and  study  both  its 
computational  complexity  and  performance  on  AEC  problems. 
Compared  to  the  complex  valued  approach,  our  method 
achieves  a  better  performance  with  lower  computational  cost. 


In  particular,  we  generate  the  real-valued  subband  signals  by 
applying  Single  Side  Band  (SSB)  modulation  [3]  on  the 
complex  GDFT  filter  banks.  The  real- valued  signals  simplify 
the  implementation  of  APA  on  each  subband  and  reduce  the 
computational  complexity  of  the  overall  AEC  systems.  Fur¬ 
thermore,  the  use  of  delayless  SAP  eliminates  the  inherent 
delay  existed  in  SAP  systems.  As  such,  our  system  is  more 
efficient  and  well-suited  for  real-time  AEC  applications. 

This  paper  is  organized  as  follows:  in  Section  II,  we  de¬ 
scribe  the  real-valued  DSAE  structure  and  the  implementa¬ 
tion  of  APA  algorithm.  Section  III  presents  the  complexity 
analysis  of  our  proposed  method  and  compares  it  with  that  of 
complex-valued  DSAE  using  APA  algorithm.  Finally,  we 
present  experimental  results  to  demonstrate  the  efficiency  of 
our  method  in  Section  IV. 


I.  Introduction 


II.System  architecture 


Affine  Projection  Algorithm  (APA)  is  widely  used  in 
Acoustic  Echo  Cancellation  (AEC)  applications  due  to  its 
improved  performance  compared  to  LMS  algorithms  and 
lower  computational  complexity  than  RLS  algorithms.  How¬ 
ever,  a  real  acoustic  echo  canceller  often  involves  adaptive 
filters  with  large  numbers  of  taps,  which  results  in  heavy 
computational  burden  and  slow  convergence  speed.  The 
large  dynamic  range  of  audio  signal  further  deteriorates  its 
performance.  Hence,  APA  is  often  combined  with  Subband 
Adaptive  Filters  (SAF)  to  save  computational  cost  and  accel¬ 
erate  convergence  speed  [  1  ] . 

A  problem  with  SAF  is  that  a  substantial  delay  is  intro¬ 
duced  into  the  signal  path  by  analysis/synthesis  filter  banks. 
This  is  particularly  undesirable  in  real-time  AEC  applica¬ 
tions.  Morgan  and  Thi  [2]  proposed  Delayless  Subband 
Adaptive  Filtering  (DSAE)  to  overcome  this  problem.  How¬ 
ever,  conventional  DSAE  usually  uses  DET  or  GDFT  based 
analysis/synthesis  filter  banks  [3]  [3],  in  which  the  generated 
subband  signals  are  “complex-valued”.  This  introduces  addi¬ 
tional  complexity  when  combining  the  APA  algorithm  with 
the  DSAE  approach  in  AEC,  as  implementing  APA  with 
complex-valued  signals  is  more  complicated  than  with  real¬ 
valued  signals. 

In  this  paper,  we  present  a  real-valued  oversampled  de¬ 
layless  subband  APA  based  on  GDFT  analysis  filter  banks. 


A.  Delayless  SAF  Structure 

Fig.l  shows  the  overall  system  structure  of  a  real-valued 
Delayless  Subband  Acoustic  Echo  Canceller  based  on  APA. 


Fig.  1 :  Real-valued  Delayless  Subband  APA  AEC  system 
Position  A:  Open-Loop;  Position  B:  Closed-Loop 

In  contrast  to  the  conventional  SAF  structure  which  re¬ 
quires  both  analysis  and  synthesis  filters.  Delayless  SAF 
avoids  using  synthesis  filter  banks  by  employing  a  weight 
transformation  technique  to  transform  the  subband  filter 
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weights  into  fullband  filter  weights.  Thus  the  transmission 
delay  can  be  eliminated  due  to  the  fact  that  the  signal  for 
canceling  the  desired  signal  is  computed  by  the  fullband 
filter.  The  Delayless  SAP  can  be  divided  into  two  categories: 
open-loop  type  (Position  A  in  Fig.l),  and  closed-loop  type 
(Position  B  in  Fig.l).  Generally  speaking,  open-loop  scheme 
will  give  less  suppression  since  the  algorithm  is  working 
blindly  with  respect  to  the  real  error  signal.  In  this  paper, 
both  of  these  two  schemes  are  implemented  and  studied. 


B.  Single-Sideband  (SSB)  Analysis  Filter  Banks 

Since  the  implementation  of  APA  is  easier  with  non¬ 
complex  arithmetic,  real-valued  subband  signals  are  pre¬ 
ferred  on  each  subband.  To  obtain  real-valued  oversampled 
filter  banks,  we  can  use  either  non-uniform  filter  banks  [4]  or 
SSB  modulated  filter  banks  [3].  The  former  scheme  typically 
needs  to  deal  with  different  subsampling  rates.  Therefore,  we 
focus  on  SSB  filter  banks  implementation  here. 

It  is  known  that  SSB  filter  banks  can  be  obtained  by 
postprocessing  complex-valued  Generalized  DFT  (GDFT) 
filter  banks  properly  [3].  Below  we  first  describe  GDFT 
briefly.  The  GDFT  transform  pair  is  defined  as: 

Ygdft  ^  ^  ^  ^  ^  0,1,....,  A  - 1 

n=0  (1) 

y{n)  =  „  =  0,1,...., /s:  - 1 

where  K  is  the  number  of  subbands,  and  Wf.  —  .  The 

analysis  filter  bank  equation  based  on  GDFT  is  then: 

CO 

A,®'""'  (m)  =  Y,  KmN  -  (2) 

n=-Go 

k  =  0X-;K-l 

where  h(n)  is  the  lowpass  prototype  filter  for  analysis  filter 
banks,  N  is  the  decimation  factor,  and  Lp  is  the  length  of 
prototype  filter  h(n).  In  this  paper,  we  only  discuss  the  case 
for  ko=l/2,  no=-(Lp-l)/2.  Therefore,  SSB  signal  Af™(OT)  can 

be  expressed  in  terms  of  GDFT  signal  as: 

Af®(m)  =  (3) 

where  coj  =  k/N  is  the  bandwidth  of  the  prototype  h(n).  Ob¬ 
viously,  the  decimation  factor  for  real-valued  SSB  filter 
banks  is  A  =  k/co^-  Compared  to  the  decimation  factor  for  a 
complex-valued  GDFT  filter  banks,  Nc  =  2ii/(Oa  ,  we  can 
conclude  that  N  is  approximately  half  of  i.e.  N  ~  Nc/2  , 
because  of  the  real  operation  in  SSB. 

Because  the  prototype  filter  h(n)  design  is  identical  to  the 
design  for  complex-valued  GDFT  filter  banks,  we  used  the 
fast  converging  iterative  least  squares  method  provided  by 
Weiss  [5]  to  design  h(n).  This  method  has  the  advantages  of 
sufficiently  suppressing  the  aliasing. 

The  implementation  of  SSB  filter  banks  for  k,k  channel  is 
shown  in  Fig.2(a).  In  this  way,  several  efficient  methods  to 
realize  GDFT  filter  banks  can  be  directly  applied  to  the  re¬ 


alization  of  SSB  filter  banks.  In  this  paper,  we  employ  the 
efficient  polyphase  implementation  of  the  GDFT  filter  banks 
proposed  by  Weiss  in  [5],  which  is  suitable  for  any  integer 
decimation  factor  N  <  K/2. 


Fig. 2(a):  Block  diagram  of  SSB  filter  bank  analyzer 
based  on  GDFT  filter  bank  for  the  k,h  channel. 


Fig. 2(b):  Spectrum  of  Prototype  Filter, 
k,i,  channel  of  GDFT  Analysis  Filter  Bank  and  SSB  Analysis  Filter  Bank 


C.  Weight  Transform 


Originally,  Morgan  suggested  a  FFT-stacking  transform 
for  complex-valued  DSAF  [2].  But  when  it  is  interpreted  as  a 
synthesis  filter  bank,  the  nulls  in  the  passbands  of  the  syn¬ 
thesis  filters  deteriorate  the  performance.  A  FFT-2  weight 
transform  presented  in  [6]  showed  a  significant  improvement 
in  performance.  In  this  paper,  we  design  a  new  stacking  rule 
based  on  the  similar  FFT-2  weight  transform. 

In  Fig. 2(b),  observe  that  all  real-valued  subband  signals 
are  modulated  to  baseband.  This  enables  us  to  obtain  the 
relations  among  fullband  and  real-valued  subband  signals. 
The  following  equation  shows  the  frequency  stacking  rules: 


((/)^ 

K 


W{1)^{ 


0 

W{2L-iy 


I  e  [0,  L) 

I  =  L 

I  G  (L,  2L) 


where  L  is  the  length  of  the  fullband  filter,  and  is  assumed  to 
be  dividable  by  the  LCM  of  K  and  2N.  W(l)  and  Wfl)  stand 
for  the  1,1,  FFT  coefficients  of  fullband  filter  and  the  k,k  sub¬ 
band  filter  respectively.  Here  [a]  denotes  the  closest  integer 
smaller  than  a,  and  {a)i,  means  ‘a  modulus  b’ . 


III.COMPUTATIONAL  COMPLEXITY  ANALYSIS 


IV.COMPUTER  SIMULATIONS 


Our  analysis  is  based  on  the  conclusion  made  in  Section 
2.2  that  N  ~  Nc/2  holds  approximately.  According  to  [2],  the 
computational  load  required  for  a  DSAF,  both  with  real¬ 
valued  signals  and  with  complex-valued  signals,  can  be 
divided  into  four  components:  subband  filtering,  subband 
adaptive  filtering  (APA  in  this  paper),  weight  transforma¬ 
tion,  and  the  signal  path  convolution  with  fullband  filter  W. 
The  computation  complexity  of  the  last  two  components  for 
real-valued  DSAF  is  same  as  that  for  complex-valued  DSAF, 
so  we  focus  on  the  analysis  of  first  two  parts. 

Subband  eiltering:  Let  Rf  denote  the  computational 
requirement  of  complex-valued  subband  decomposition  and 
R/  that  of  real-valued  delayless  subband  decomposition.  For 
a  GDFT  analysis  filter  bank,  if  polyphase  FFT  implementa¬ 
tion  is  used,  we  need 

R;  ^{L^  +  K\og,K  +  4K)/N^  (5) 

real  multiplications  per  input  sample  [2],  where  Lp  represents 
the  length  of  prototype  filter  h(n).  For  the  corresponding 
real-valued  SSB  filter  banks,  SSB  modulation  requires  addi¬ 
tional  K/N  multiplications  per  sample.  Since  N  ~  Nc  /2,  we 
conclude  that  real-valued  SSB  filter  banks  usually  require 
twice  computations  as  GDFT  filter  banks,  i.e.  R{  ~  2  R{ . 
However,  compared  with  the  computational  savings  in  the 
second  step  (see  analysis  below),  this  overhead  is  small  and 
ignorable. 

Subband  adaptive  filtering:  When  Fast  APA  is  used 
and  the  subband  signals  are  real-valued,  the  computational 
requirement  in  this  step  is  2M+20P  real  multiplications  per 
sample  [7].  Here  M  is  the  adaptive  filter  order  and  P  is  the 
projection  order.  As  each  complex  multiplication  requires 
four  real  multiplications,  it  is  easy  to  obtain  the  computa¬ 
tional  requirement  for  real-valued  DSAF  and  complex¬ 
valued  DSAF. 


—  (2-^  +  20P)*  — 

N  N  2 

(6) 

4  1  K 

—  {2—  +  20P)*  — 

N^  2 

(7) 

Replacing  Nc  with  2N,  we  get  the  ratio  between  them: 

R2  _{2L/N  +40P)*K/2 

Rj  ~  {2LIN  +  20P)*KI2 

In  AEG  applications,  projection  order  P  is  often  set  to  a 
value  like  8  or  16,  which,  after  multiplied  by  20  or  40,  is 
much  larger  than  2L/N.  In  this  case,  P  becomes  the  dominant 
factor  in  deciding  R2.  Approximately,  we  have  7^2^  ~  2  R2. 

Note  that  from  the  above  analysis,  the  computational  load 
for  subband  APA  can  be  highly  independent  of  its  adaptive 
filter  length  when  P  is  large.  In  summary,  the  real-valued 
delayless  subband  APA  implementation  requires  less  compu¬ 
tational  power  and  is  more  efficient. 


The  impulse  response  to  model  the  echo  path  in  our 
simulations  is  a  small  room  impulse  response,  truncated  with 
1024  taps.  Consequently,  the  number  of  the  fullband  filter 
taps  L  is  chosen  as  1024.  The  sampling  frequency  is  22050 
Hz  in  the  fullband.  The  adaptations  are  performed  in  K=128 
subbands  with  a  decimation  factor  of  N=32  for  real-valued 
DSAF  and  a  decimation  factor  of  Nc=64  for  complex-valued 
DSAF.  In  addition,  we  assumed  that  there  are  no  near-end 
speech/music  signals.  Instead,  an  ambient  noise  n(n)  with  a 
fixed  SNR  of  30dB  is  added  into  the  near-end  microphone 
(see  Fig.l).  The  measure  of  Echo  Return  Loss  Enhancement 
(ERLE)  curves  are  presented  in  the  simulation  results.  ERLE 
is  defined  as 


ERLE  =  l0log^o{ 


E[y{nf] 

E[e{nf] 


(9) 


A.  ERLE  Convergence  Plots  for  White  Noise  Input  x(n) 

Eig.3  shows  the  ERLE  convergence  of  our  proposed  real¬ 
valued  delayless  subband  APA  with  white  noise  input.  Eor 
the  open-loop  scheme  (Fig. 3(a)),  the  results  for  both  com¬ 
plex-valued  delayless  subband  RLS  algorithm  and  complex¬ 
valued  delayless  subband  APA  are  depicted  for  comparison 
purpose,  while  complex-valued  delayless  subband  NLMS 
and  complex-valued  delayless  subband  APA  were  compared 
in  closed-loop  scheme  (Fig. 3(b)).  Obviously,  compared  to 
complex-valued  delayless  subband  APA,  our  proposed  real¬ 
valued  delayless  subband  APA  achieves  better  performance 
in  terms  of  both  ERLE  convergence  speed  and  ERLE  level. 
The  increase  in  projection  order  P  from  2"‘'  to  4*  demon¬ 
strates  a  faster  ERLE  convergence  rate  and  closer  perform¬ 
ance  to  RLS  algorithm.  Furthermore,  the  closed-loop  scheme 
can  achieve  a  higher  echo  suppression  level  once  it  con¬ 
verges. 

B.  ERLE  Convergence  Plots  for  Real  Music  Input 

Fig.4  shows  the  performance  of  our  proposed  real-valued 
delayless  subband  APA  for  real  music  input.  Since  the  per¬ 
formance  of  our  proposed  real-valued  delayless  subband 
APA  is  very  close  to  its  complex-valued  counterpart  for  real 
music  input,  we  only  compare  it  to  the  conventional  com¬ 
plex-valued  delayless  subband  RLS  for  open-loop  scheme 
(Fig.4(a))  and  delayless  subband  NLMS  for  closed-loop 
scheme  (Fig.4(b))  respectively.  Similar  performance  im¬ 
provements  as  in  the  case  with  white  noise  are  observed. 
However,  the  performance  improvement  caused  by  increas¬ 
ing  P  is  not  so  significant. 

V.  Conclusions 

In  this  paper,  we  proposed  a  real-valued  delayless  sub¬ 
band  affine  projection  algorithm  (APA)  and  applied  it  to 
acoustic  echo  cancellation  applications.  The  system  imple- 


mentation  details  were  presented  and  the  computation  com¬ 
plexity  was  analyzed.  The  simulation  results  show  that  its 
performance,  in  terms  of  ERLE  level  and  convergence 
speed,  is  better  than  that  of  conventional  complex-valued 
DSAF.  In  addition,  the  computational  complexity  of  our 
proposed  approach  is  in  most  cases  less  than  its  complex¬ 
valued  counterpart. 
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Fig. 3(a):  ERLE  Convergence  for  White  Noise  Input  using  Open- 
Loop  Configuration:  Our  proposed  Real-Valued  Delayless  Sub¬ 
band  APA  shows  a  closer  performance  to  Conventional  Complex- 
Valued  Delayless  Subband  RLS  than  Complex-valued  case. 
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Fig. 4(a):  ERLE  Convergence  for  Real  Music  Input  using  Open- 
Loop  Configuration:  Our  proposed  Real-Valued  Delayless  Sub¬ 
band  APA  shows  a  closer  performance  to  Conventional  Complex- 
Valued  Delayless  Subband  RLS  when  P  is  increased. 
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Fig. 3(b):  ERLE  Convergence  for  White  Noise  Input  using  Closed- 
Loop  Configuration:  Our  proposed  Real-Valued  Delayless  Sub¬ 
band  APA  shows  a  better  performance  than  Conventional  Com¬ 
plex-valued  Delayless  Subband  NLMS/APA. 


Fig. 4(b):  ERLE  Convergence  for  Real  Music  Input  using  Closed- 
Loop  Configuration:  Our  proposed  Real-Valued  Delayless  Sub¬ 
band  APA  shows  a  better  performance  than  Conventional  Com¬ 
plex-Valued  Delayless  Subband  NLMS  for  real  music  input. 


